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(54) Networking method 

(57) This invention provides dynamic balance of the 
traffic among data processing devices interconnecting 
networks and thereby improve the networking perform- 
ance. For network traffic flowing between a first network 
and a second network, the traffic is distributed among 



the data processing devices that act as routers accord- 
ing to the traffic amount. An algorithm for balancing the 
traffic is used to select appropriate data processing 
devices as routers. 
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Description 

Background of the Invention 

The present invention relates to the interconnection 
among data networks and more spedfically to traffic 
load balancing, for instance, among multiple routers or 
among multiple interfaces in a parallel processing sys- 
tem (devices). 

Computer networks are usually connected to other 
computer networks and the connection between the 
computer networks forms internets. Connection 
between two given networks is implemented by one or 
more data processing devices (see D.E. Comer, "Inter- 
networking with TCP/I P. "volume 1. Prentice Hall, 1991). 

These data processing devices include, but are not 
limited to. routers, gateways and switches, and will be 
generally and interchangeably referred to as routers or 
gateways in this specification. 

An example shown in Figure 1 represents a plurality 
of data processing devices (3a, 3b. 3c. 3d. 3e, 4a. 4b) 
and three networks (1. 2, 5). The data processing 
devices 3a, 3b act as routers between network 1 and 
network 2. The data processing device 3c acts as a 
router between networks 2 and 5. The data processing 
device 4b acts as a router between networks 1 and 5. 
The data processing devices are connected to the net- 
works by network interfaces (30a, 30b. 30c, 30d, 30e, 
3la. 31b. 31c. 40b. 41a and 41b). These network inter- 
faces (30a, 30b, 30c, 30d, 30e, 31a, 31b, 31c. 42b, 41a 
and 41b) are identified by physical addresses and also 
by network addresses. 

On various kinds of networks, the ARP protocol 
(Address Resolution Protocol) (see "Internet Engineer- 
ing Task Force RFC 826") is used to correlate physical 
and network addresses. Pfiysical and network 
addresses hereafter are given by a notation consisting 
of the interface number of Figure 1 , a dash, and a suffix 
"P" or "N" indicating that the address is a physical 
address or a network address. 

When a data processing device in one network 
communicates with a data processing device in another 
network, the communication between the two data 
processing devices is done by using one or more rout- 
ers (data processing devices) between one network and 
the other network, to transfer a communication mes- 
sage (packet) from one network to the other network. 
Generally, the correlation between a network (target 
network) to which a packet (communication) is trans- 
ferred to and a router that transfers the packet to the tar- 
get network is shown in a routing table of the data 
processing devices (see "Internetworking with TCP/IP" 
cited above). This method for indicating the route is. 
hereafter, referred to as an "explicit routing table setup". 
In Figure 1 . data processing devices 3a. 3b. 3c. 3d and 
3e have routing table 32a, 32b, 32c, 32d and 32e. 
respectively Figures 2a - 2e show an example of routing 
tables in the explicit routing setup for routes which trans- 



fer packets from the data processing devices (3a, 3b, 
3c, 3d, 3e) connected to the network 2 of Figure 1 to the 
network 1. Routing tables {32a, 32b. 32c. 32d, 32e) 
each have an entry (321a, 321b, 321c, 321d, 321e) rep- 

5 resenting a target networK an entry (322a, 322b. 322c. 
322d, 322e) representing a next hop address of the tar- 
get network, and a flag (323a. 323b, 323c, 323d, 323e). 
The flag may have the values "interface" or "gateway". 
When the value of the flag Is "Interface", the next hop 

10 address means the address of the network interface 
which is directly connected to the target network, in 
case the data processing device in question is directly 
connected to the target network. When the value of the 
flag is "gateway", the next hop address means the 

75 address of a router which transfers packets to the target 
network. This value of the flag is used in case a data 
processing device in question is not connected to the 
target network 

In the routing tables of the data processing devices 

20 3a, 3b. the network addresses (31a-N, 31b-N) of their 
respective interfaces to the network 1 are shown as the 
next hops (322a. 322b), and the value of the flag is 
"interface" (323a. 323b). The routing tables of the data 
processing devices 3c, 3d, 3e give 30a-N as the next 

25 hop (322c. 322d. 322e) and the value of the flag is 
"gateway" (323c. 323d. 323e). thus showing that the 
data processing device 3a is a router to network 1 . 

Other methods can be used to interconnect two or 
more networks. Two of these methods (Proxy ARP. 

30 OSPF protocol) are explained below. 

Proxy ARP (where "ARP" is the Address Resolution 
Protocol) is a method for making routers transparent in 
communication between two or more networks (see 
RFC 1027), by making one or more routers in the net- 

35 works act as proxies. On communications from one net- 
work to another networK the routers reply ARP 
requests on the former network querying a network 
addresses in the later networK then receive communi- 
cations on the former network addressed to the later 

40 network and route them to the later network Thus, 
these routers transparently bridge two or more networks 
(refer to "Intemetworking with TCP/IP" cited above). To 
this end, the correlation between the physical 
addresses and the network addresses needs to be set 

45 Up. 

Figure 3 shows an example of such a setup (proxy 
ARP setup), in which the correlation between network 
addresses 711 and physical addresses 712 is set as 
special entries on the ARP cache 71. In this example, 

50 the router 3a acts as a proxy for communication flowing 
from the network 1 to the network 2. A "public" flag 713 
indicates that the entry should be used to answer ARP 
queries. In this example, any of ARP queries to the net- 
work addresses 30C-N, 30d-N and 30e-N in the network 

55 2 will be answered by 31a-R Figure 3 shows an exam- 
ple of this setup accomplished in the ARP cache of the 
data processing device 3a. This setup can be imple- 
mented in the ARP cache of any of the data processing 
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devices 3a, 3b, 4a and 4b connected to the network 1 . 

The proxy ARP setup as implemented above and 
the explicit routing table setup are both performed by 
the administrator of each data processing device. This 
means that these setups are static, i.e., once performed s 
they remain the same and can only be changed by man- 
ual intervention by the administrator. Hence, when there 
is a malfunction or when a new router is installed, these 
setups must be changed manually. 

A method of changing the routing table dynamically io 
according to changes in the network is provided by the 
OSPF protocol (see RFC 1245. 1246 and 1247). In this 
case, the routers exchange routing information and 
change their routing tables according to this informa- 
tion. 15 

The basic algorithm of the OSPF protocol is shown 
in Figure 4. The data processing device broadcasts a 
message including the networks it can reach and the 
distances to these networks determined by the number 
of hops (step 822), and also receives such messages 20 
from other data processing devices (step 823). When 
the route changes (step 824), each router calculates the 
shortest path from itself to each of the networks (step 
825) and sets its routing table according to the paths 
(step 826). 25 

Figure 5 shows an example of the use of the OSPF 
in the networks of Figure 1 . In this figure, all the data 
processing devices (3a. 3b. 3c. 3d. 3e, 4a, 4b) inter- 
change data by using the OSPF protocol and thus have 
a control add-on 91 for executing the OSPF basic algo- so 
rithm. Alternatively, the OSPF can be used only in a 
subset of the networks 1 . 2, 5 or in a subset of the data 
processing devices (3a. 3b. 3c. 3d. 3e, 4a. 4b). or in 
both subsets. 

In a special case, interconnected networks include 35 
not only parallel processing devices but also massively 
parallel processing devices and workstation clusters. 
These parallel processing devices contain a plurality of 
nodes that are interconnected by networks. Examples 
of such machines are Fujitu's AP3000, IBM's RS/6000 40 
SP, and Digital Corp.'s Tru-Cluster. The case of a paral- 
lel processing device is shown in Figure 1. in which the 
data processing devices connected to the second net- 
work 2 are the nodes of the parallel processing device 
6. In this configuration, the parallel processing device 45 
has multiple interfaces for other networks to improve the 
reliability and the networking performance. The data 
processing devices (4a, 4b) of other networks (1,5) are 
mainly clients which access the services provided by 
the parallel processing device 6. so 

Summary of the Invention 

In the above examples (in the case of using Fig. 2 
and Fig. 3), all communication between the network 1 55 
and the data processing devices (3c. 3d, 3e) pass 
through only the data processing device 3a. Therefore, 
the data processing device 3a is a potential bottleneck 



for this communication. This bottleneck can be allevi- 
ated by assigning a part of the communication to the 
router 3a (the data processing device 3a) and by 
assigning other part of the communication to the router 
3b, so that the traffic (communication) passes through 
two routers. Herein, the data processing device is a nor- 
mal computer which has a processing unit, a memory, a 
disK a cache, an interface for the network, and so on, 
and an operating system; and an application program 
stored in the memory and the disk are executed by the 
processing unit. However, the network traffic changes 
with time, and if most of the network traffic flows through 
the data processing device assigned as one of the rout- 
ers at a certain time, the bottleneck persists. Hence, 
balance of the traffic among multiple routers must be 
controlled dynamically, rather than statically, in order to 
be effective. 

The explicit routing table setup and the Proxy ARP 
setup are static and thus can provide static traffic bal- 
ancing, but not dynamic traffic balancing. 

The OSPF protocol definition (RFC 1247) states 
that "when there exist several equivalent routes to a 
destination, traffic is distributed equally among them." 
IXJt does not mention the method of distributing the traf- 
fic. The OSPF protocol leaves the distribution of the traf- 
fic to the execution of the routing algorithm. Therefore, 
although in the OSPF the route changes dynamically, 
the OSPF by itself cannot provide dynamic traffic bal- 
ancing. 

Data processing devices which execute user pro- 
grams such as personal computers and workstations, 
are often used as routers. Because the routing gener- 
ates processing load, the processing performance of 
such data processing devices is decreased by the 
processing load created by the routing. Hence, when 
selecting a route so as to balance the network traffic, it 
is necessary to additionally consider the processing 
load to improve the networking performance and the 
processing throughput. As in the network traffic, the 
processing load changes with time and thus the route 
must be changed dynamically with the processing load. 
As described earlier, the explicit routing table setup and 
the Proxy ARP setup are static and therefore cannot 
provide such dynamic traffic balancing. The OSPF pro- 
tocol selects the routers that give the shortest path to 
the target network but this selection does not consider 
the processing load of the routers. Therefore, the OSPF 
protocol cannot provide dynamic traffic balancing 
according to the processing load of the routers. 

It is desirable that the routing be changed in the 
event of a network failure to provide reliability. Such a 
feature is provided by the OSPF and is desirable also for 
Proxy ARP However, because the Proxy ARP setup is 
static, it cannot provide such a dynamic traffic change. 

The problem of traffic balancing among a plurality 
of routers between two networks is solved by choosing 
for each data processing device, a router which trans- 
fers the network traffic of this data processing device, 
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based on the network traffic of each data processing 
device. Next, the routing tables of the data processing 
devices, the Proxy ARP association between physical 
and network addresses, and the routes to be used by 
the OSPF protocol are set by the above selection. This 5 
procedure is cyclically repeated to provide dynamic traf- 
fic balancing. 

The problem of the processing load in traffic bal- 
ancing is solved by using an equivalent network traffic, 
which is appropriately converted from the processing w 
load of the router, in the above procedure of selecting 
the routers. 

The problem of reliability of the Proxy ARP is solved 
by executing the above traffic balancing procedure and 
by not distributing the network traffic to those routers 15 
whose normal operation is prevented by some malfunc- 
tion. Furthermore, when a router failure or a router 
recovery is detected, the routes should also be 
selected. 

20 

Brief Description of the Drawings 

Figure 1 is an example of a network in which this 
invention is embodied. 

Rgure 2a is an explanatory diagram of a routing 25 
table setup of a data processing device 3a in Figure 1 . 

Rgure 2b is an explanatory diagram of a routing 
table setup of a data processing device 3b in Figure 1 . 

Rgure 2c is an explanatory diagram of a routing 
table setup of a data processing device 3c in Figure 1 . 30 

Figure 2d is an explanatory diagram of a routing 
table setup of a data processing device 3d in Figure 1 . 

Rgure 2e is an explanatory diagram of a routing 
table setup of a data processing device 3e in Figure 1 . 

Figure 3 is an explanatory diagram of an ARP 35 
cache setup for Proxy ARP. 

Figure 4 is a flowchart of the basic operation of the 
OSPF algorithm. 

Figure 5 is a block diagram of a control add-on for 
the OSPF 40 

Figure 6 is an explanatory diagram of a global rout- 
ing table. 

Figure 7 is a flowchart of an algorithm for selecting 
routers. 

Figure 8 is a diagram correlating the processing 45 
load and the network traffic. 

Rgure 9 is a flowchart of an algorithm for distribut- 
ing the traffic. 

Rgure 1 0 is a flowchart of a procedure for balanc- 
ing the traffic when OSPF is not used. 50 

Figure 11 is a block diagram of an embodiment that 
does not use the OSPF 

Rgure 12 is an explanatory diagram of a routing 
table setup for balancing traffic. 

Rgure 13 is an explanatory diagram of an ARP 55 
cache setup for distributing the traffic. 

Figure 14a is an explanatory diagram of a routing 
table setup of a data processing device 3a in an inter- 



face failure. 

Rgure 1 4b is an explanatory diagram of a routing 
table setup of a data processing device 3c in an inter- 
face failure. 

Rgure 1 4c is an explanatory diagram of a routing 
table setup of a data processing device 3d in an inter- 
face failure. 

Figure 14d is an explanatory diagram of a routing 
table setup of a data processing device 3e in an inter- 
face failure. 

Rgure 15 is an explanatory diagram of an ARP 
cache setup in an interlace failure. 

Rgure 16a is an explanatory diagram of a routing 
table setup of a data processing device 3c in multiple 
router failures. 

Rgure 16b is an explanatory diagram of a routing 
table setup of a data processing device 3d in multiple 
router failures. 

Rgure 16c is an explanatory diagram of a routing 
table setup of a data processing device 3e in multiple 
router failures. 

Rgure 1 7 is a flowchart of a procedure executed by 
an OSPF master device. 

Rgure 18 is a flowchart of a detailed procedure 
executed by an OSPF master device. 

Rgure 19 is a flowchart of a procedure executed by 
an OSPF slave device. 

Rgure 20 is a block diagram showing an embodi- 
ment when an OSPF is only used. 

Rgure 21 is a block diagram showing an embodi- 
ment when an OSPF is used with the Proxy ARP. 

Detailed Description of the Preferred Emt>odiment 

The whole process of traffic balancing includes the 
following three steps: (1) choosing (selecting) the rout- 
ers, (2) calculating the distribution of traffic among these 
routers, and (3) changing the route according to the dis- 
tribution, in that order. These three steps are described 
in this order and some examples are given in the expla- 
nation of the three steps. All the following explanations 
concerns a single network hereinafter referred to as 
"the network in question." Other networks are hence- 
forth referred to as "target networks." 

The explicit routing table setup and the OSPF are 
used to set the routes for communication from the net- 
work in question to the target network The Proxy ARP 
is used to set the routes for communication from the tar- 
get network to the network in question. Hence, when the 
data processing devices connected to the second net- 
work 2 are nodes of the parallel processing device 6, 
the explicit routing table setup and the OSPF are used 
to set the routes for communication from the parallel 
processing device 6 to the clients 4a. 4b. The Proxy 
ARP is used to set the routes for communication in the 
reverse direction. 
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(1) Selection of routers 

Selection of routers is made to provide reliability. 
This is done by selecting normally operating routers 
from among all possible routers to the target network, s 
This selection process is done in OSPF. In the case of 
the explicit routing setup and the Proxy ARR the follow- 
ing procedure is used First, the network administrator 
has to prepare a global routing tatrfe for the network in 
question. 

Figure 6 shows an exanple 73 of a global routing 
table for the network 2. i.e.. the network in question. For 
each target network 731, this table shows a router (ID) 
732 to this network. For each router, this table shows a 
physical address 733 of the interface to the target net- 
work, a network address 734 w^ the network in question, 
the next hop 735 to the target network after this router, 
and a priority 736 of the roote This priority is set by the 
network administrator accord*^ to the rxjnrtxsr of hops 
up to the target network and accordirig to trie fretwork 
processing capability. Hence. tf>e t>ea roui# will be 
selected preferentially. In Figi#e 6 prorfty is i when the 
number of hop is 1 and priority ft 2 wh#n tr^ number of 
hops are 2. 

The global routing table 73 m Figure 6 ts used as an 
input to the algorithm for selecting routers n Figi^e 7. At 
a first step (step 812). the algorithm reads aB entries in 
the global routing table (73) m a memory At the next 
step (step 813), the algorithm verifies the status of all 
routers (732), i.e., if the routers (732) are active or not, 
and if the indicated network interlaces (733. 734. 735) 
are active or not. At the next step (step 814). the algo- 
rithm removes all router entries (732. 733. 734. 735, 
736) from the table in the memory for which the routers 
or interfaces are not active. At the next step (step 815), 
the algorithm selects router entries (732. 733. 734. 735, 
736) with the lowest value of priority (736) and elimi- 
nates other router entries (732. 733. 734. 735. 736) 
from the table in the memory. At step 81 5, tor each net- 
work, the data processing devices that are to operate as 
routers to the target network 731 appear in the column 
732. 

(2) Calculating the distribution of traffic among the rout- 
ers 

Once the routers have been chosen, the routes can 
be selected. First, the network administrator has to pre- 
pare for the network in question a list of the data 
processing devices involved in the traffic balancing. Fig- 
ure 8 shows an example of such a list for the network 2. 
The list of the data processing devices 72 includes the 
identification code (ID) 721 for discriminating all data 
processing devices for which traffic balancing is applied 
to and the factors a(k) 722 for correlating the processing 
load of this device and the routing load. The factors a(k) 
will be described below in more detail. 

The algorithm for the calculation (84) of the routes 



to provide traffic balancing is shown in Figure 9. Herein. 
T is an index of a data processing device that does not 
operate as a router, "j" is an index of the target network 
(731 in Figure 6). and "k" is an index of a data process- 
ing device that acts as a router (732 in Figure 6). The 
traffic (i. j) is the traffic between the data processing 
device i and the network j. A load (k) ts the processing 
load of a data processing device, and it is equal to or 
larger than zero. The load (k) increases with the 
processing load of the data processing device when this 
data processing device can execute user programs, and 
when it cannot, is equal to zero. The algorithm also uses 
the value of route (k) representing the amount of traffic 
(i, j) assigned to the router k. The algorithm basically 
distributes the traffic (i, j) among the routes (k) in such a 
way that the values of the routes (k) among different 
routers k are balanced. 

The algorithm for selecting a route is given in Figure 

9. 

At the first step (step 842), the algorithm obtains the 
total traffic (i, j) and the load (k) from other data process- 
ing devices. 

At the next step (step 843), the algorithm initializes 
the value of the route (k) to a(k) x load (k) (route (k) = 
a(k) X load(k)). where a(k) (722 in Figure 8) is taken 
from the data processing device list (72). Hence, the 
processing load of the router is incorporated into the 
calculation as a networking toad. The "constant a(k)" 
provides the tradeoff between the network traffic bal- 
ancing and the processing load balancing among the 
routers. Lower overall values of a(k) give priority to the 
network traffic balancing and higher overall values give 
priority to the processing load balancing. The relative 
values of a(k) represent the performance of a data 
processing device, wherein lower values of a(k) repre- 
sent higher performance of the data processing devices 
(i.e.. a lighter processing load is produced by routing the 
same amount of network traffic). 

At the next step (step 844). the algorithm calculates 
route(k) = route(k) + traffic (i. j) for alt k in which all the 
traffic between the networks can flow only through this 
router k. All the values of traffic (i, j) used at this stage 
(step 844) are set to zero. 

At the next step (step 845). the algorithm obtains i 
arxi j so that the traffic (i, j) that is selected is the highest 
value of traffic (i, j) which is left. 

At the next step (step 846). the algorithm finds k 
that has the smallest value of route (k) among the rout- 
ers for the target network j. This selection means that 
the network traffic between the network j and the data 
processing device i is to pass through the router k. 

At the next step (step 847), the traffic is assigned to 
the router k found in step 846. i.e.. route (k) = route (k) 
+ traffic (i. j) for the i and j selected in step 845, whose 
traffic (i. j) is then set to zero. 

At the final step (step 848), if there is any non-zero 
value of the traffic (i. j). the algorithm returns to the step 
(step 845). 
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(3) Changing the route according to the distribution 

Once the correspondence between the data 
processing devices and the routers is obtained, the cor- 
responding routes are set. The process of setting the 
routes varies depending on whether the OSPF Is used 
or not. 

When the OSPF is not used, the whole process of 
traffic balancing (i.e., three steps of (1) selecting rout- 
ers, (2) calculating the distribution of traffic among these 
routers, and (3) changing the route according to the dis- 
tribution) is shown in Figure 10. After selecting the 
router (step 81) and traffic distribution (step 84). the 
algorithm executes an explicit routing table setup (step 
852) and Proxy ARP setup (step 853). The process is 
periodically repeated to provide dynamic load balanc- 
ing. 

The setup for routes for the explicit route setup 
(852) is done as follows. The route for the target net- 
work (731 in Figure 6). which is set in the data process- 
ing device to be used as a router, is taken from the 
column of the next hop (735 in Figure 6) in the global 
routing table 73 in Figure 6. In this column 735, the next 
hop corresponds to the above mentioned router. In this 
case, the flag 323 is set to "interface/ The routes for 
other data processing devices and for the data process- 
ing devices that can work as routers but whose network 
interfaces for the target network is inactive, are taken 
from the column of network 734. In this column 734. the 
network conresponds to the router assigned to the data 
processing device at step 847. In this case, the flag is 
set to "gateway." The routes are not set for the inactive 
data processing devices or for the network in question 
whose network interfaces are not active. 

The setup (853) for routes for the Proxy ARP is as 
follows. The network address of the interface of a data 
processing device is related to the physical address of 
the interface of the router selected for this data process- 
ing device. The physical address is taken from the phys- 
ical address column (733) of the glok>al routing table 73. 
If the data processing device with the Proxy ARP link- 
age is not operational, the Proxy ARP linkage should be 
set in another data processing device connected to the 
target network (731). The Proxy ARP setup is only per- 
formed when the target network (731) is adjacent to the 
network in question. When the physical address entry 
(733) of the global routing table 73 is blank, the inter- 
connection using the Proxy ARP Is not possible. Hence 
the Proxy ARP setup is not performed. When the ARP 
linkage changes, the data processing device having this 
linkage can broadcast an ARP reply having a new ARP 
linkage. In this case, other data processing devices in 
the target network (731) receive this reply. The other 
data processing devices compare the linkage contained 
in the reply with the content of their own ARP cache. 
Then, if any change is found, the other data processing 
devices change the linkage in the ARP cache to the link- 
age contained in the reply, and the other data process- 



ing devices transmit the traffic to the new physical 
address. 

Therefore, if an ARP reply is broadcast when the 
linkage changes, the new linkage is reflected on the 

5 data processing devices, realizing the dynamic traffic 
balance of the network traffic among routers. When the 
ARP reply is not broadcast, the new linkage is reflected 
on the data processing devices that send ARP queries 
to the target network (731) later 

10 In any of the data processing device (3, 4) that exe- 
cute user programs, the algorithm of Figure 10 can be 
executed by running a control add-on which uses fiard- 
ware and/or software. 

Figure 11 shows a preferred embodiment of this 

15 invention. A control add-on (92) is installed in the same 
data processing device in which an ARP cache (71) is 
provided, to facilitate the setup of the ARP cache. Alter- 
natively, the control add-on (92) can be located in any of 
the data processing devices (3a, 3b, 3c. 3d, 3e, 4a and 

20 4b). In the case of a parallel processing device (6), a 
preferred embodiment for the control add-on (92) is in 
one of the nodes (3a, 3b, 3c, 3d and 3e) of the parallel 
processing device. This is because special features of 
the operating system of the parallel processing device 

25 (6) can be used to gather data from other nodes and 
execute the setup in other nodes. 

The values of the traffic (i, j) and the load (k) can be 
taken from a data processing device in which the control 
add-on (92) is located by a system call. The network 

30 traffic and the traffic (i, j) of other data processing 
devices can be obtained, for exanrtple, by the Simple 
Network r\/lanagement Protocol (SNMP see RFC 1 157). 
The processing load of other data processing devices 
can be obtained by, for instance, the nwho protocol or 

35 the Internet Systat Sen^ice. The routes and the Proxy 
ARP linkage can be changed by a system call in a data 
processing device in which the control add-on (92) is 
running and by the SNMP protocol in other data 
processing devices. 

40 An example of the algorithm of Figure 10 applied to 
the examples of Figure 1 and Figure 6 is as follows. It is 
assumed that the network traffic associated with the 
network 1 of the data processing devices 3c, 3d. 3e is 1 
M byte/sec for each device and the traffic (i . j) is the net- 

45 work traffic measured at M byte/sec. It is also assumed 
that the data processing devices 3a, 3b, 3c and their 
interfaces operate without malfunction, that the data 
processing devices have the ability to execute user 
process, that the data processing devices 3a and 3c are 

50 not executing user processes and the data processing 
device 3b is executing one user process, and that the 
load (k) in a data processing device k is measured by 
the number of user processes in the data processing 
device k. 

55 The algorithm (81) for selecting routers reads the 
global routing table 73 at the first step (step 81 2) and, at 
the next step (step 813). verifies that the data process- 
ing devices 3a. 3b, 3c and their interfaces operate with- 
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out malfunction. At the next step (step 814), the entries 
of these data processing devices (3a. 3b, 3c) are not 
eliminated from the table as they (3a, 3b, 3c) are active. 
At the step 815. the data processing devices 3a and 3b 
are selected because they have the smallest value of s 
priority. Hence, the data processing devices 3a and 3b 
are selected as routers between the network 1 and the 
network 2. 

Next, at the first step (step 842), the algorithm (84) 
for balancing the traffic obtains the processing loads of to 
the data processing devices 3a and 3b and the network 
traffic between the network 1 and the data processing 
devices 3c, 3d, 3e. At step 843, the route (3a) is 
assigned 0 and the route (3b) is assigned 1 .5. At step 
844, no traffic is assigned to the network 1 . is 

Next, the algorithm repeats the steps 845, 846 and 
847 three times and assigns the traffic to the routers. 
The data processing devices 3c. 3d, 3e are assumed to 
have been selected in that order at step 845. Thus, in 
the first repetition, the data processing device 3c is 20 
assigned to the router 3a and the route (3a) is updated 
to 1 (= (traffic = 1) + (route (3a) = 0)). In the second rep- 
etition, the data processing device 3d also is assigned 
to the router 3a and the route {3a) is updated to 2 (= 
(traffic = 1) + (route(3a) = 1)). In the third repetition, the 2S 
data processing device 3e is assigned to the router 3b 
and the route (3b) is updated to 2.5 (= (traffic = 1) + 
(route(3b) = 1.5)). Therefore, the network traffic of the 
data processing devices 3c. 3d, 3e is distributed 
between the data processing devices 3a. 3b, because 30 
the data processing device 3b has a heavier processing 
load, greater traffic is given to the data processing 
device 3a. The route to and from the data processing 
device 3e is changed to the router 3b in tNs way 

The routing table of the data processing device 3e 35 
after the explicit routing table setup (step 852) is exe- 
cuted and is shown in Figure 1 2, The ARP cache after 
executing the Proxy ARP setup (step 853) is shown in 
Figure 13. The entry encircled by a dotted line is a 
changed one. 40 

Two examples for explaining reliability will be 
described below. 

The first example is one in which the network inter- 
face of the data processing device 3a (31a) in the net- 
work 2 fails, but data processing device 3a continues 45 
operating. In the algorithm (81 in Figure 7) for selecting 
routers, the failure of the data processing device 3a is 
detected at step 813 and the entry of this failed router is 
eliminated at step 814. The entry of the data processing 
device 3b is selected at step 81 5 because it has the low- 50 
est priority value. Thus, only the data processing device 
3b is used as the router. Next, in the traffic distribution 
algorithm (84 in Figure 9), the traffic of the data process- 
ing devices 3a, 3c. 3d. 3e are assigned to the router 3b. 
The routing tables (32a. 32c. 32d. 32e) of the data 55 
processing devices 3a, 3c, 3d. 3e after the explicit rout- 
ing table setup (step 852) is performed are shown in 
Figs. 14a - 14d. The ARP cache (71) after the Proxy 



ARP setup (step 853) is shown in Figure 15. The entry 
encircled by a dotted line is a changed one. In this 
example, the communication among the data process- 
ing devices (3a. 3b. 3c, 3d and 3e) in the second net- 
work is not affected by failures of the network interfaces. 
Similarly, in the case of the parallel processing device 
(6). the communication between the parallel processing 
device and the clients (4a, 4b) continues even after the 
route has been changed. 

Another example concerning reliability is a case 
that both data processing devices 3a and 3b totally fail. 
In this case, the algorithm (81) for selecting routers 
detects the failures of the data processing devices 3a 
and. 3b at step 813. And the entries of these routers are 
eliminated at step 814. Thus, only the entry of the data 
processing device 3c remains and is selected as the 
router. Next, in the traffic distribution algorithm (84). the 
traffic of the data processing devices 3d and 3e is 
assigned to the router 3c. The routing tables (32c, 32d. 
32e) of the data processing devices 3c, 3d, 3e after the 
explicit routing table setup (852) are shown in Figure 
16a - 16c. All of the entries in the ARP cache (71) have 
been eliminated because the physical address (733) 
corresporxiing to the router 3c are not set. In this exam- 
ple, the communication between the data processing 
devices (3c, 3d. 3e) that are still active in the network in 
question and the data processing devices (4a. 4b) in the 
target network are interrupted. This is because only the 
route from the network 2 to the network 1 is set and the 
route from the network 1 to the network 2 is not set. 
However, if the same system is also applied to the net- 
work 1 , the traffic from the network 1 to the network 2 
can flow through the data processing device 4b, and 
therefore the communication between the networks 1 
and 2 is not interrupted. The same also holds for the 
parallel processing device (6). 

When the OSPF is used in the network in question, 
two major differences arise. First, the router is selected 
by the OSPF processing, and therefore the OSPF is 
used in place of the router selection algorithm (81 ). Sec- 
orxl, while the router exchange routes by the OSPF 
operate in such a way as to perform distribution, the 
router selection algorithm (81) and the traffic distribution 
algorithm (84) should be executed in a centralized fash- 
ion, to avoid inconsistencies among the routes selected 
by various data processing devices, and to prevent the 
processing load that may be produced if all of the data 
processing devices query the processing load and net- 
working traffic of one another. 

Thus, the OSPF (Open System Path First) is 
extended as follows. One of the data processing 
devices that exchanges data by the OSPF (hereinafter 
referred to as "master") is responsible for centralized 
tasks such as that of selecting the routes and broad- 
casting these routes to other data processing devices 
that execute the OSPF Other data processing devices 
(hereinafter referred to as "slaves") set these routes 
accordingly A detailed description of the procedure per- 
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formed by the master and the slaves is as follows. 

The basic algorithm of the master is shown in Fig- 
ure 1 7. Step 822 to step 826 are shown in Figure 4. The 
OSPF basic algorithm is extended by an additional step 
(step 831) and executed periodically This step 831 is 
detailed in Figure 18. In Figure 18. first, a traffic balance 
calculation algorithm (step 34) is executed. The step 84 
is explained in Figure 9. Next, the routes in the master 
data processing device are set (step 8311). Then, the 
selected routes are broadcast to the slave data 
processing devices (step 8312). Next, the routes are set 
in the data processing devices that do not exchange 
data by OSPF (step 8313). Finally, when Proxy ARP is 
used along with OSPF, the Proxy ARP correspondence 
is set (step 853). The basic algorithm for the slave is 
shown in Figure 19. Step 822 to step 826 are shown in 
Figure 4. Here the basic OSPF a>gorrthm is extended by 
two additional steps (step 832 and step 633) and also 
periodically executed. In the first stop of t>e two addi- 
tional steps, the slave data processing device recefves 
the selected routes broadcast by tne master ana at tt^e 
next step (step 833), sets its own routes accordng»y tt 
should be noted that the special •IC^la^ge of data is 
performed independently of the OSPF protocca* and tnus 
there is no need to change the OSPF protocol In ttie 
master and slave, when the router to be used cttanges 
as a result of the status change in tt>e router, the master 
and slave should select a route artxtrarity from those 
available until a traffic balance calculaton resuft is 
obtained and broadcast. Otherwise, the previous route, 
if still valid, should be retained. This route then changes 
depending on the result of the traffc distr*xjtion algo- 
rithm (84). 

Two preferred embodiments of control add-on for 
the master 93 and slave 94 are shown in Figure 20 and 
21. In the embodiment of Figure 20. the OSPF is used 
in all networks (1 , 2, 5). In Figure 20. the control add-on 
of the master 93 is located in the data processing device 
3a. Alternatively it may be located any of the data 
processing devices (3a. 3b. 3c. 3d. 3e. 4a. 4b) that 
exchange data by the OSPF In the emboc* ment of Fig- 
ure 21. the OSPF is used in one or nx>re networks (in 
the figure, network 2), and Proxy ARP is used to set 
routes for the traffic coming from one or more networks 
(in the figure, network 1) adjacent to the network that 
uses the OSPF The control add-on of the master 93 is 
located in the same data processing device in which the 
ARP cache (71) is located, in order to facilitate the setup 
of the ARP cache (71). Alternatively the master may be 
located in any network using the OSPF. and the ARP 
cache (71) may be located in any data processing 
device (3a. 3b. 4a, 4b) in the target network. 

The three examples for the explicit routing table 
setup and the Proxy ARP operate in a similar way to the 
case of the OSPF. in that the same routes may be 
selected by the algorithm (84) similarly to the algorithm 
(81) which selects routers through the shortest path 
selection (825), and in that the routes (32a. 32b, 32c. 



BNSDCX:iD: <EP 08581 B9A2_I_> 



32d. 32e) set in the routing table of the data processing 
devices in the network of interest and the linkage set in 
the ARP cache (71) may be the same. The same also 
applies to the parallel processing device (6). 
5 This invention has five advantages. 

First, good networking performance is provided by 
dynamically balancing the network traffic among routers 
between two networks. 

Second, the invention takes the routing traffic from 
10 a data processing device having a higher processing 
load, thereby utilizing the data processing devices more 
effectively and increasing the job processing capability. 

Third, In the case of failure of one or more routers, 
the invention transparently changes the routing of the 
75 Proxy ARP to improve the networking reliability. 

Fourth, this invention has compatibility with the 
existing network protocols and thus these protocols 
need not be changed. In the case of the explicit route 
setup and the Proxy ARP, the operating system of the 
20 data processing devices does not need any change. 
When there are two or more equivalent paths when the 
invention is applied to the OSPF protocol, only the con- 
trol add-on for executing the OSPF protocol needs to be 
changed. 

25 Fifth, in the case of parallel processing devices, 
because not all nodes (data processing devices) require 
external network interfaces to communicate with other 
netvw)rks efficiently, the cost of parallel processing 
devices can be reduced. 

30 While preferred embodiments have been set forth 
with specific details, further embodiments, modifications 
and variations are contemplated according to the 
broader aspects of the present invention, all as deter- 
mined by the spirit and scope of the following claims. 

35 

Claims 

1 . In a first network and a second networK each inter- 
connecting a plurality of data processing devices 
40 and interconnected by a subset of the data 
processing devices, a networking method to select 
routes to balance network traffic among the data 
processing devices that interconnect the first and 
second networks, comprising: 

45 

a first step of obtaining an amount of network 
traffic flow between the data processing 
devices of the first and second networks; and 
a second step of selecting for each data 

50 processing device routes according to the 

amount of network traffic flow. In a way that net- 
work traffic flow between the data processing 
devices interconnecting the first and second 
networks is distributed among the data 

55 processing devices of the second network, so 

that the network traffic flow among the data 
processing devices interconnecting the first 
and second networks is balanced. 
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2. A networking method according to claim 1. further 
comprising: 



3. A networking method according to claim 2, wherein 

the second step uses the processing load of the 75 
data processing devices which are converted to 
network traffic. 

4. A networking method according to claim 1 , wherein 

an Address Resolution Protocol (ARP) and a Proxy 20 
ARP method are used in the first network and 
dynamically balances the network traffic from the 
first network to the second network among the data 
processing devices interconnecting the first and 
secorxj networks, the networking method further 25 
comprising: 



5. A networking method according to claim 2. wherein 
an Address Resolution Protocol (ARP) and a Proxy 
ARP method are used in the first network and which 
dynamically balances the network traffic from the 
first network to the second network among the data 
processing devices interconnecting the first and 
second networks, the networking method further 
comprising: 



16 

6. A networking method according to claim 4, wherein 
at the fourth step, the network traffic of the data 
processing devices from the first network to the 
second network is distributed among only data 
processing devices interconnecting the first and 
second networks which have no malfunction that 
prevents the transfer of the network traffic between 
the first and second networks. 

7. A networking method according to claim 5. wherein 
at the fourth step, the network traffic of the data 
processing devices from the first network to the 
second network is distributed among only data 
processing devices interconnecting the first and 
second networks which have no malfunction that 
prevents the transfer of the network traffic between 
the first and second networks. 

8. A networking method according to claim 1 , further 
comprising: 

a step of changing routes to the first network in 
a routing table of each of the data processing 
devices of the second network to data process- 
ing devices interconnecting the first and sec- 
ond networks that are selected to act as the 
routers for the data processing devices of the 
second network, 

wherein the network traffic from the sec- 
ond network to the first network is balanced 
dynamically among the data processing device 
interconnecting the first and second networks. 

9. A networking method according to claim 2, further 
comprising: 

a step of changing routes to the first network in 
a routing table of each of the data processing 
devices of the second network to data process- 
ing devices interconnecting the first and sec- 
ond networks that are selected to act as the 
routers for the data processing devices of the 
second network, 

wherein the network traffic from the sec- 
ond network to the first network is balanced 
dynamically among the data processing device 
interconnecting the first and second networks. 

10. A networking method according to claim 1, wherein 
an Open Shortest Path First (OSPF) protocol is 
used in the second network and dynamically bal- 
ances the network traffic from the secortd to the first 
network among the data processing devices inter- 
connecting the first and second networks, the net- 
working method further comprising: 

a step of deciding routes of other data process- 
ing devices by one data processing device, 



EP 0 858 189 A2 



a third step of obtaining a processing toad of 
the data processing devices interconnecting s 
the first and second networks. 

wherein the second step uses the 
processing load of the data processing 
devices, and the routes are selected according 
to the network traffic and the processing load in 10 
such a way as to balance the network traffic 
and the processing load. 



a fourth step of correlating, by the Proxy ARP, a 
network addresses of the data processing 
de\rtces of the second network with a physical 30 
addresses of the interfaces connected to the 
first network of the data processing devices 
interconnecting the first arxi second networks, 
the data processing devices being selected as 
the routers for the data processing devices of 3S 
the second network, wherein the first, second, 
third and fourth steps are repeated. ' 



40 



45 



a fourth step of correlating, by the Proxy ARP, a 
network addresses of the data processing 
devices of the second network with a physical so 
addresses of the interfaces connected to the 
first network of the data precising devices 
interconnecting the first and second networks, 
the data processing devices being selected as 
the routers for the data processing devices of 55 
the second network, wherein the first, second, 
third and fourth steps are repeated. 
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according to the amount of network traffic flow, 
and 

a step of broadcasting tfie decided routes to 
the other data processing devices by using the 
OSPF protocol. 5 

11 . A networking method according to claim 1 , wherein 
an Open Shortest Path First (OSPF) protocol is 
used in the second network and dynamically bal- 
ances the network traffic from the second to the first io 
network among the data processing devices inter- 
connecting the first and second networks, the net- 
working method further comprising: 

a step of receiving routes broadcast by other is 
data processing devices; and 
a step of setting routes on the data processing 
devices which receive the broadcast routes, 
according to the received routes. 

20 

12. A networking method according to claim 2. wherein 
an Open Shortest Path First (OSPF) protocol is 
used in the second network and dynamically bal- 
ances the traffic to the network traffic from the sec- 
ond to the first network among the data processing 2S 
devices interconnecting the first and second net- 
works, the networking method further comprising: 

a step of deciding routes of other data process- 
ing devices by one data processing device, 30 
according to the amount of network traffic flow, 
and 

a step of broadcasting the decided routes to 
the other data processing devices by using the 
OSPF protocol. 35 

13. A networking method according to claim 2, wherein 
an Open Shortest Path First (OSPF) protocol is 
used in the second network and dynamically bal- 
ances the network traffic from the second to the first 40 
network among the data processing devices inter- 
connecting the first and second networks, the net- 
working method further comprising: 

a step of receiving routes broadcast by other 45 
data processing devices; and 
a step of setting routes on the data processing 
devices which receive the broadcast routes, 
according to the received routes. 

so 

14. A networking method according to daim 1 , wherein 
the data processing devices connected to the sec- 
ond network are nodes of a parallel processing 
device, and the first network connects the parallel 
processing device to other data processing ss 
devices. 

15. A networking method according to daim 4, wherein 



the data processing devices connected to the sec- 
ond network are nodes of a parallel processing 
device, and the first network connects the parallel 
processing device to other data processing 
devices. 

16. A networking method according to claim 8. wherein 
the data processing devices connected to the sec- 
ond network are nodes of a parallel processing 
device, and the first network connects the parallel 
processing device to other data processing 
devices. 

17. A networking method according to claim 10, 
wherein the data processing devices connected to 
the second network are nodes of a parallel process- 
ing device, and the first network connects the paral- 
lel processing device to other data processing 
devices. 

18. A networking method according to claim 11, 
wherein the data processing devices connected to 
the second network are nodes of a parallel process- 
ing device, and the first network connects the paral- 
lel processing device to other data processing 
devices. 

19. In a first network and a second networK each inter- 
connecting a plurality of data processing devices, 
both networks being interconnected by routers 
which are a subset of the data processing devices, 
a networking method on one of the data processing 
devices conrtprising: 

a step of gathering an amount of a network traf- 
fic flow among the data processing devices of 
the first network and the second network; 
a step of computing routes to provide network 
traffic balancing among the data processing 
devices which interconnect the first and the 
second networks, according to the amount of 
the network traffic flow; 

a step of setting routes on one data processing 
device based on the computed routes; and 
a step of sending the computed routes from the 
one data processing device to other data 
processing devices so that the other data 
processing devices set routes according to the 
sent routes. 

20. In a first network and a second network, each inter- 
connecting a plurality of data processing devices, 
both networks being interconnected by routers 
which are a subset of the data processing devices, 
a networking method on one of the data processing 
devices comprising: 

a first step of gathering routes from other rout- 



10 



BNSDOCiD: <EP 08581 e9A2_l_> 



19 EP0 858 189 A2 20 

ers; 

a second step of setting routes on one data 
processing device based on the gathered 
routes: 

a third step of gathering an amount of a net- s 
work traffic flow on the data processing devices 
that interconnect the first network and the sec- 
ond network; 

a fourth step of computing routes to provide 
network traffic balancing among data process- 10 
ing devices which interconnect the first and the 
second networks, according to the amount of 
the network traffic flow; 

a fifth step of setting routes on one data 
processing device, based on the computed is 
routes; and 

a sixth step of sending the computed routes to 
other data processing devices, so that the 
other data processing devices set routes 
according to the sent routes. 20 

21. A networking method according to claim 20, 
wherein said first and second steps are part of the 
OSPF protocol. 

25 
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